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(54) Method and system for reducing multimedia conference bandwidth 



(57) A method, system and computer program 
product for reducing the digital signal processing analy- 
sis performed by a multimedia conference unit 102 to 
support a multipoint conference call is described. The 
multimedia conference unit 102 first determines which 
one of a plurality of communication units 108, 110, 112, 
113 is a dominant communication unit the other com- 
munication units being subordinate communication 

100 ^ 



units. The multimedia conference unit 102 then com- 
mands the subordinate communication units to sup- 
press a portion of their respective signals, such that the 
digital signal processing analysis performed by the mul- 
timedia conference unit 102 to support the multipoint 
conference is reduced and network bandwidth used is 
minimised. 
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Description 

[0001] The invention generally relates to the fields 
of communication systems and videoconferencing. 
More particularly, the invention is directed to a method, 
system and software for reducing the bandwidth 
required to conduct a multipoint conference. 
[0002] Telephone conferencing systems have been 
available for many years. These systems have primarily 
focused on providing audio conferencing. A typical con- 
ference includes a group of individuals who are tele- 
phonically connected into a discussion by an operator at 
a central locality. In recent years, however, the addition 
of video capabilities has greatly increased the band- 
width required to establish a multipoint audio-video con- 
ference. 

[0003] By way of example, Figure 1A illustrates a 
conventional network 10 for conducting an audio-video 
multipoint conference. The network 10 includes multiple 
personal conferencing systems (in this example, PCS 
12a, PCS 12b, PCS 12c and PCS 12d) as well as a mul- 
timedia conference unit (MCU) 14 that is coupled to 
PCSs 12a - 1 2d. In the situation where the PCSs 12a - 
12d are coupled ever a local area network (LAN), the 
network 10 is illustrative of a conventional telephony- 
over-LAN (ToL) network. A PCS may be a video tele- 
phone, telephony-enabled computer, and/or portable 
device able to send and receive to each other directly 
via network 10, which may be a LAN or a wireless net- 
work. 

[0004] Generally, the MCU 14 is capable of joining 
PCSs 12a - 1 2d (ToL users in this specific example) in 
multipoint videoconferences. During a typical multipoint 
videoconference, the MCU 14 receives all video and 
audio signals from the participating PCSs and typically 
re-transmits the mixed audio signals of participating 
PCSs and the video signal originating from the domi- 
nant or presenting PCS to all participating PCSs, includ- 
ing the presenter. As seen in the example of Fig. 1A 
(which does not show audio signals but only shows 
video signals for simplicity), supposing PCS 12a is the 
presenter, the MCU 14 receives the audio signals and 
video signals from all PCSs 12a-12d, determines PCS 
12a to be dominant, and then rebroadcasts the mixed 
audio signals and video signals originally from PCS 12a 
to PCSs 12a - 12d. In this example, eight video connec- 
tions (video signals 15a-15d from PCSs 12a-12d to 
MCU 14, and video signals 16a-19a from MCU 14 to 
PCSs 12a-12d, where video signals 16a-19a carry the 
video signals 15a sent from PCS 12a to MCU 14) 
between the MCU 14 and the PCSs 12a - 1 2d via LAN 
hub 17 are required. This conventional system requires 
the MCU 14 to perform high level digital signal process- 
ing (DSP) of the multiple received and transmitted audio 
and video signals between all parties and MCU 14 
involved in the videoconference. This high level of DSP 
analysis results in the MCU 14 being very expensive 
compared to an audio-only MCU, which is compara- 
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tively simple and inexpensive. Moreover, the use of 
eight video streams in addition to the audio connections 
normally used (not shown in Fig. 1A) results in heavy 
use of network bandwidth. 
5 [0005] Another conventional approach to videocon- 
ferencing utilizes an MCU 14 capable of providing a 
multicast stream, as shown in the example of Fig. 1 B. In 
the example of Fig. 1 B (which does not show audio sig- 
nals but only shows video signals for simplicity), sup- 
10 posing PCS 12a is presenting, MCU 14 receives the 
audio signals and video signals from all PCS 12a-12d, 
determines PCS 12a to be dominant, and then retrans- 
mits the mixed audio signals and the video signals from 
PCS 12a to PCS 12b-12d in a multicast stream. In this 
is example, the required video streams are reduced to five 
video streams (video signals 15a-15d from PCSs 12a- 
12d to MCU 14, and video multicast signal 20a from 
MCU 14 to PCSs 12b-12d, where video multicast signal 
20a carries the video signals 15a originally sent from 
20 PCS 12a to MCU 14) between the MCU 14 and the 
PCSs 12a - 1 2d via LAN hub 17, when the presenter 
PCS 12a views its own presentation. It should be noted 
that in those cases where the presenter PCS 12a 
wishes to view someone else's presentation, an addi- 
25 tional video stream (15b, shown in Fig. 1 B as a dotted 
arrow from MCU 14 to LAN hub 17. where the dotted 
arrow 15b carries the video signal of a participant, e.g. 
PCS 12b, other than the presenter) from the MCU 14 to 
the PCS 12a via LAN hub 17 is required (and video 20a 
30 to PCS 1 2a would replace video 1 5b to PCS 1 2a), for a 
total of six video streams. Although the reduction in 
video streams from MCU multicast capability reduces 
the required DSP analysis somewhat, MCU 14 must still 
be capable of processing at least five video streams, 
35 which is still very expensive in comparison to an audio- 
only MCU. 

[0006] In addition to requiring expensive processing 
power in the MCU 14, multiple audio and video streams 
for supporting the videoconference session require 
40 large network bandwidth resources, which may not be 
available in some circumstances when network traffic is 
heavy. Therefore, what is desired is an improved 
method and apparatus for reducing the bandwidth 
required to conduct a videoconference. 
45 [0007] The invention is defined in the independent 
claims, to which reference should now be made. Further 
advantageous features are detailed in the dependent 
claims. 

[0008] Broadly speaking, embodiments of the 
so invention relates to an improved method, system and 
computer program product for reducing the digital signal 
processing analysis required to support a multipoint 
conference call among a plurality of callers coupled via 
a network to a multimedia conference unit. The multime- 
55 dia conference unit first determines which caller is a 
dominant caller, the other callers being subordinate call- 
ers. The multimedia conference unit then commands 
the subordinate callers to suppress a portion of their 
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signals passed over the network. In some embodi- 
ments, the portion are video signals, and only the dom- 
inant caller transmits video signals to at least the 
subordinate callers either via the multimedia conference 
unit or directly to the subordinate callers, depending on 
whether point-to-point connection capability between 
callers exists. 

[0009] In one embodiment, the callers pass audio 
signals and video signals over the network, and the mul- 
timedia conference unit uses the audio signals to deter- 
mine which of the callers is dominant. In another 
embodiment, when the multimedia conference unit 
determines that the dominant caller has changed, the 
multimedia conference unit commands the previous 
dominant caller to stop sending video signals in the form 
of video packets and the new dominant caller, if not 
already sending video signals, to start sending video 
signals in the form of video packets. 
[00101 These and other embodiments with advan- 
tages of the present invention will become apparent 
from the following detailed description and drawings of 
embodiments thereof, in which 

Fig. 1A is a conventional network illustrating the 

number of video streams required to conduct a 

multipoint videoconference; 

Fig. IB illustrates the number of video streams 

required to conduct a multicast videoconference 

using the network shown in Fig. 1 A; 

Fig. 2 is a ToL network in accordance with an 

embodiment of the invention; 

Fig. 3A is a ToL network with unicast streams, in 

accordance with an embodiment of the present 

invention; 

Fig. 3B is a ToL network with unicast streams, in 
accordance with embodiments of the present 
invention; 

Fig. 4A is a ToL network with multicast streams, in 
accordance with alternate embodiments of the 
invention; 

Fig. 4B is a ToL network with multicast streams, in 
accordance with embodiments of the invention; 
Fig. 5A is a flowchart detailing a process for reduc- 
ing the video bandwidth required and MCU digital 
signal processing required to support a multipoint 
audio-video conference in accordance with embod- 
iments of the invention; 

Fig. 5B is a flowchart detailing a process for reduc- 
ing the video bandwidth required to support a 
multipoint audio-video conference in accordance 
with alternative embodiments of the invention; and 
Fig. 6 illustrates a typical, general-purpose compu- 
ter system suitable for implementing the present 
invention. 

[0011] This invention is generally directed to a 
method system and computer product for performing 
audio-video multipoint conferencing. More particularly, 



embodiments of the present invention reduce the band- 
width required to carry out the audio-video multipoint 
conference by suppressing the transmission of video 
signals from the listening' 1 (or subordinate) users to the 

5 MCU and by utilizing video signals transmitted from the 
"speaking" (or dominant) user to the MCU, while main- 
taining the audio transmissions from all subordinate and 
dominant users to the MCU. The present invention uti- 
lizes much less network bandwidth with the resultant 

10 concomitant reduction in video MCU complexity and 
cost. In accordance with a specific embodiment of the 
present invention, the MCU continually samples the 
audio signals from the subordinate users to determine if 
any one of them is the dominant user and/or has "taken 

75 over" the presentation and therefore has become the 
dominant user. When so determined, the MCU enables 
a video stream between it and the new dominant user to 
receive the dominant video signal from the dominant 
user and then distributes the dominant video signal to 

20 the other subordinate users. Thus, reduced video 
streams (from the users to the MCU) are required in an 
audio-video conference according to various specific 
embodiments of the present invention. In other specific 
embodiments with an MCU having multicast ability, the 

25 video streams needed can be further reduced. In still 
other specific embodiments, the MCU can set up the 
video signal as a multicast signal from the dominant 
user, thereby eliminating the need for the dominant user 
to send the signal to the MCU which re-sends it to the 

30 other users. 

[0012] By having the dominant video signal and 
none of the subordinate video signals sent to the MCU, 
the resultant savings in bandwidth on the network, e.g., 
a LAN, is substantial. Similarly, fewer video streams to 

35 the MCU reduce the DSP analysis required, so that the 
cost and complexity of the MCU is also reduced. 
[0013] The invention is described in the context of 
an audio-video multipoint conferencing system with 
telephony-enabled computers on a LAN, such as shown 

40 in Fig. 2. However, it should be noted that the invention 
can be used for other types of conferencing systems. 
Such other systems include wireless conferencing sys- 
tems in which reduction in bandwidth is considered 
important. 

45 [0014] Referring now to Fig. 2, a schematic block 
diagram of an audio-video multipoint conferencing sys- 
tem 100 of the type employed with specific embodi- 
ments of the invention, is shown. The system 100 is 
preferably an International Telecommunications Union 

so (ITU)-Telephony Standardization Sector (TSS) compli- 
ant conferencing system. In a preferred embodiment, 
the network 100 supports ITU-recommended stand- 
ards. For example, one such standard is the H.323 pro- 
tocol that covers multimedia over non-guaranteed 

55 bandwidth packet switched networks. The Internet and 
LANs using TCP/IP and SPX/IPX protocols running 
over Ethernet or Token Ring are examples of packet 
switched networks with non-guaranteed bandwidth. The 
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H.323 protocol is a set of protocols that sits on top 
TCP/IP and provides interoperability among different 
vendors and platforms of products for multimedia com- 
munication applications that will run over LANs and the 
Internet. The H.323 standard specifies call control, mul- 
timedia management, and bandwidth management for 
point-to-point and multipoint conferences. H.323 also 
specifies protocols and elements that allow communica- 
tion among LANs and other networks such as the 
PSTN. 

[0015] The system 100 includes a multimedia con- 
ferencing unit (MCU) 102 that includes a selector unit 
103, a LAN hub 106, and four personal communication 
systems (PCS) 108. 110, 112 and 114. It should be 
noted that any number of PCSs can be employed in the 
system 100 and only four are shown for the sake of clar- 
ity. 

[0016] The MCU 102 is one of the primary compo- 
nents of the system 100. It can connect to public or pri- 
vate networks 1 16 by way of communication links, such 
as an audio 1 1 8 and video 1 20 T1 . A T1 communication 
link is a traditional telephone network trunk that pro- 
vides twenty-four telephone channels. When the net- 
work interface is configured as a T1 interface, the MCU 
102 can support audio-only information as well as a mix 
of audio and video information. The MCU 102 digitally 
interfaces with the PCSs 108 - 114 by way of the U\N 
hub 106. The operation and structure the LAN hub 106 
are governed, for example, by IEEE 802. Each of the 
PCSs 108-114 includes an audio-video interface to 
enable the operators to see and hear conferees, as well 
as to be seen and heard by conferees. As stated above, 
MCU 102 includes a selector unit 103. In the described 
embodiment, the selector unit 103 is arranged to select, 
as desired, a dominant PCS. 

[0017] In the described embodiment, each of the 
PCSs 108 - 114 supports hardware and software 
required to enable an operator to participate in a video- 
conference. This includes video cameras, microphones, 
along with the required encoding, decoding, and com- 
munications software and hardware. The operator of 
any of the PCSs 108 - 1 14 can establish and connect to 
a videoconference using a graphical user interface 
(QUI) displayed on the respective PCS. Once the con- 
nection is created, video from the operator's camera is 
encoded and then transmitted to the multimedia confer- 
ence unit and current conference video can be transmit- 
ted to the PCSs 108-1 14. 

[001 8] In an audio-video conference, each conferee 
interfaces to the system 100 by way of their respective 
PCS that contains a video coder/decoder (codec) 
board. Audio-video conferences are set up according to 
the codec capabilities of the participating conferees or 
according to a minimum codec capability determined to 
be in effect for the conference. The capabilities for the 
conference can be fixed or variable. If a conferee cannot 
meet the capabilities established for the conference, 
that conferee can attend the conference in an audio 



mode only or the MCU 102 can step down the capabili- 
ties to allow the conferee to join the conference, auto- 
matically or by command. 

[0019] Referring to Fig. 2, in order to set up an 
5 audio-video conference among clients (or users or call- 
ers) A, B. C. and D operating the PCSs 108, 1 10. 112. 
and 114. respectively, each of the clients calls into the 
MCU 102 using standard H.323 call setup commands. 
In one specific embodiment of the invention, the MCU 
10 102 arbitrarily selects the first client to dial in as the 
dominant client. For this example, suppose client A on 
the PCS 108 is determined to be the dominant client 
and that clients B, C. and D are subordinate clients. In 
response to the dialing in of the dominant client A, the 
15 MCU 1 02 sends H .323 commands to clients B. C. and D 
to stop sending video signals. The transmission of audio 
packets over links 126 - 130 from the PCSs 110-114 
(associated with the clients B, C, and D, respectively) to 
the MCU 102 continues as normal. The transmission of 
20 video packets from subordinate PCSs 1 10-1 14 over the 
links 126 - 130 via LAN hub 106 to MCU 102 is thus 
suppressed. In this way, only link 124 is used to send 
video packets ("Video A" in Fig. 2) from the dominant 
PCS 108 to the MCU 102. Thus, the present invention 
25 reduces the video streams from all of the PCSs to MCU 
102 down to one video stream only from the dominant 
PCS. The audio transmissions normally used for the 
audio-video conference are not changed (or sup- 
pressed) with embodiments of the present invention, but 
30 merely sampled to determine any changes in the domi- 
nant PCS based on audio dominance (therefore, the 
below Figs, do not show the audio signals but only show 
the video streams). In accordance with specific embod- 
iments, the present invention can further reduce the 
35 number of total video streams established to the subor- 
dinate PCSs, as discussed below. 
[0020] Fig. 3A illustrates an audio-video multipoint 
conference using unicast streams, in accordance wilh 
an embodiment of the invention. After the dominant 
40 PCS video stream ("Video A" in Fig. 3A) is received by 
MCU 102, MCU 102 uses four video streams to clients 
A, B, C and D when MCU multicasting is not an option, 
for reasons related to. for example, network topology or 
architecture. In this situation, a total of five video 
45 streams between MCU 102 and clients A, B, C and D 
are required for the audio-video conference. 
[0021] If client A would like to see another client, 
such as client B, instead of itself, then additional video 
streams from PCS 108 to MCU 102 and from MCU 102 
50 to PCS 110 can be formed as shown by dotted line 
arrows labeled "Video B" between MCU 102 and LAN 
hub 106 in Fig. 3B, which illustrates other embodiments 
using unicast streams. In such situations, the dominant 
client A can see each of the conferees to which he is 
55 speaking rather than watch himself on his own monitor. 
In some embodiments where clients can have point-to- 
point connections to other clients, then at the beginning 
of a conference, the MCU 102 can randomly select one 
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user for the current dominant speaker to view, and ask 
client B, for example, to begin sending video directly 
("Video B" in Fig. 3B) to client A thereby bypassing the 
MCU 102, as illustrated in Fig. 3B by omitting the two 
dotted line arrows labeled "Video B". 
[0022] In either case for Fig. 3B, as the conference 
progresses and different users "take over", the audio 
portion of the MCU 102 can remember the previously 
dominant speaker, and ask the previously dominant 
user to send a video stream to the current dominant 
speaker. In this way, the current speaker can view the 
previous speaker instead of his own face. People 
engaged in a back-and-forth discussion during a video- 
conference would be able to see each other's reactions 
while they are talking in this manner. In this situation 
where the dominant client desires to view another client, 
a total of six video streams (including dotted line arrows 
"Video B" for the scenario where the clients do not have 
point-to-point connection capability) or four video 
streams (omitting dotted line arrows "Video B" for the 
scenario where the clients do have point-to-point con- 
nection capability) between MCU 102 and clients is 
used. 

[0023] Fig. 4A illustrates an audio-visual confer- 
ence with multicast ability, in accordance with other spe- 
cific embodiments of the invention. After the dominant 
PCS video stream (dotted line arrow "Video A" from 
LAN hub 106 to MCU 102 in Fig. 4A) is received by 
MCU 102 and if MCU multicasting is available, MCU 
1 02 sends a multicast "Video A" (dotted line arrow from 
MCU 102 to LAN hub 106) via LAN hub 106 to clients A, 
B, C and D. Thus, two video streams between MCU 102 
and LAN hub 106 are needed. In embodiments where 
clients have the capability for point-to-point connections 
(in these embodiments, dotted line arrows "Video A" 
between MCU 102 and LAN hub 106 would be omitted 
from Fig. 4A), then MCU 102 informs dominant client A 
to send multicast video directly to clients A, B, C and D. 
MCU 102 also informs clients A, B, C. and D to begin 
receiving the multicast signal of the dominant client con- 
feree (i.e., client A). Therefore, video packets from the 
dominant client A are sent directly to clients B. C, and D, 
without going through MCU 102. Only mixed audio 
packets (i.e., audio from all conferees) are sent from the 
MCU 102 to the clients A. B, C, and D. 
[0024] If dominant client A wishes to view someone 
else's presentation, then an additional video stream is 
opened by MCU 102 between dominant PCS 108 and 
the selected suborcfinate client, as shown in Fig. 4B. 
Fig. 4B therefore shows two additional video streams 
(dotted line arrows "Video B" between MCU 102 and 
LAN hub 106) than those shown in Fig. 4A The specific 
embodiments of Fig. 4B are similar to the description of 
the specific embodiments of Fig. 4A. More specifically, 
Fig. 4B shows embodiments having a MCU with multi- 
cast capability and clients without point-to-point connec- 
tion capability where the dominant client A wishes to 
see one or more other clients other than himself, such 



as for example client B (Rg. 4B with all four dotted line 
arrows). Rg. 4B further shows embodiments having a 
MCU with multicast capability and clients with point-to- 
point connection capability where the dominant client A 

5 wishes to see one or more other clients other than him- 
self (Fig. 4B omitting all four dotted line arrows). 
[0025] In the above described embodiments, as the 
conference progresses, the MCU 102 monitors the 
audio channels of all clients in the conference and if the 

w dominant client stops talking and another client takes 
over, the MCU 102 dynamically alters the video packet 
transmissions based on changes in audio dominance. 
By way of example, if client A stops talking and client D 
starts to dominate the talk, the MCU 102 will arrange to 

15 have client D send video packets to clients A, B, and C 
as the dominant video stream. In some embodiments, 
MCU 102 will instruct the now-subordinate client A to 
stop transmitting video and instruct the now-dominant 
client D to start transmitting video, and MCU 102 will re- 

20 transmit (either unicast or multicast) video from client D 
to other subordinate clients. In other embodiments 
where client point-to-point connection capability exists. 
MCU 102 will instruct the now-subordinate client A to 
stop transmitting video and instruct now-dominant client 

25 D to start transmitting video to the other now-subordi- 
nate clients. In ail embodiments, the audio stream 
remains unchanged and is monitored for changes in cli- 
ent dominance. In any case, the present invention can 
significantly reduce the digital signal processing 

30 required in the MCU by reducing the total amount of 
video streams between the MCU and the network, with 
a total of four to six such video streams in the above uni- 
cast embodiments and a total of zero to four such video 
streams in the above multicast embodiments, as dis- 

35 cussed in detail above. In addition, the network band- 
width is reduced with the specific embodiments of the 
present invention. 

[0026] Fig. 5 is a flowchart detailing a process 500 
for establishing a videoconference in accordance with 

40 specific embodiments of the invention. It should be 
noted that the process 500 is implemented in some 
embodiments in the network 100. The process 500 
begins at step 502 by all parties involved in the prospec- 
tive videoconference calling the MCU in order to estab- 

45 lish the proper video and audio channels. At step 504, 
the MCU determines which of the calling parties is the 
dominant caller. The determination of dominance can 
be accomplished in many ways, one of which is by sim- 
ply designating the caller who sends the first call to the 

so MCU as dominant. This is typical of most conference 
calls since the person requesting the conference call is 
typically the one making the initial connection. The MCU 
then determines if there is a new dominant caller and 
establishes the minimally required video packet streams 

55 (as discussed in detail above) based upon that determi- 
nation at 506. This can occur, for example, if it is deter- 
mined that the person contacting the MCU first is not the 
dominant caller as well as during setting up the corrfer- 
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ence call. This determination can be, in one embodi- 
ment, established by the MCU monitoring (e.g., 
sampling) the audio packet traffic of the users, with the 
user having the most audio packet traffic being deter- 
mined to be the dominant user. In another embodiment, 
the dominant caller can be self-identified as such to the 
MCU. 

[0027] tf the MCU has determined that the domi- 
nant user has changed to a new dominant caller, then 
the MCU in step 508 sends a command to the previous 
dominant caller (if any) and any subordinate callers 
(who may be sending video packets to the dominant 
caller via the MCU) to stop transmitting video packets to 
the MCU. In alternative embodiments, the MCU may 
instruct a subordinate caller to start transmitting video 
packets to the MCU for transmission to the dominant 
caller, should the dominant caller desire to view another 
caller rather than himself. The MCU then sends in step 
510 a command to the new dominant caller to start 
transmitting video packets to the MCU. In either case, 
the MCU in step 512 sends the dominant video stream, 
in either a multicast or a unicast mode, to at least the 
subordinate users. If it is determined in step 514 that the 
conference call is not ending, then the system proceeds 
back to step 506 to determine if the dominant caller has 
changed. If the conference call is ending, the MCU 
sends a disconnect sequence to all users at 516. 
[0028] Fig. 5B is a flowchart detailing a process for 
reducing the video bandwidth required to support a 
multipoint audio-video conference in accordance with 
alternative embodiments of the invention. The process 
550 begins at step 552 by all parties involved in the pro- 
spective videoconference calling the MCU in order to 
establish the proper video and audio channels. At step 
554, the MCU determines which of the calling parties is 
the dominant caller. The determination of dominance is 
accomplished as already discussed above. The MCU 
then determines in step 556 if there is a new dominant 
caller and arranges video streams based upon that 
determination. When the dominant caller is determined, 
the MCU sends commands to the dominant caller to 
begin transmitting point-to-point video connections 
(either unicast or multicast) to at least the subordinate 
callers and the MCU instructs at least the subordinate 
callers to being receiving video from the dominant 
caller. The dominant caller then transmits video to the 
other callers and/or itself (if no other designated non- 
dominant caller is sending video to the dominant caller). 
[0029] If the MCU has determined that the domi- 
nant user has changed to a new dominant caller, then 
the MCU in step 558 sends a command to the previous 
- dominant caller (if any) and any subordinate callers 
(which may have been sending video to the dominant 
caller) to stop transmitting video packets via their 
respective point-to-point connections. In alternative 
embodiments, the MCU may instruct a subordinate 
caller to start transmitting video packets to the MCU for 
transmission to the dominant caller, should the domi- 



nant caller desire to view another caller rather than him- 
self. The MCU then sends in step 560 a command to the 
new dominant caller to start transmitting video packets 
to at least the now-subordinate callers. In either case, 

5 the dominant caller in step 562 sends the dominant 
video stream, in either a multicast or a unicast mode, to 
at least the subordinate users. If it is determined in step 
564 that the conference call is not ending, then the sys- 
tem proceeds back to step 556 to determine if the dom- 

to inant caller has changed. If the conference call is 
ending, the MCU sends a disconnect sequence to all 
users at 566. 

[0030] Fig. 6 illustrates a typical, general-purpose 
computer system 600 suitable for implementing the 
is present invention in the form of a personal communica- 
tions system. The computer system 600 includes any 
number of processors 602 (also referred to as central 
processing units, or CPUs) that are coupled to memory 
devices including storage devices 604 (typically a read 
20 only memory, or ROM) and primary storage devices 606 
(typically a random access memory, or RAM). Compu- 
ter system 600 or, more specifically CPUs 602, may be 
arranged to support a virtual machine, as will be appre- 
ciated by those skilled in the art. As is well known in the 
25 art, ROM 604 acts to transfer data and instructions uni- 
directionally to the CPUs 602. while RAM 606 is used 
typically to transfer data and instructions in a bi-direc- 
tional manner. CPUs 602 may generally include any 
number of processors. Both primary storage devices 
30 604, 606 may include any suitable computer-readable 
media. A secondary storage medium 608, which is typ- 
ically a mass memory device, is also coupled bi-direc- 
tionally to CPUs 602 and provides additional data 
storage capacity. The mass memory device 608 is a 
35 computer-readable medium that may be used to store 
programs including computer code, data, and the like. 
Typically, mass memory device 608 is a storage 
medium such as a hard disk or a tape which is generally 
slower than primary storage devices 604, 606. Mass 
40 memory storage device 608 may take the form of a 
magnetic or paper tape reader or some other well- 
known device. It will be appreciated that the information 
retained within the mass memory device 608. may, in 
appropriate cases, be incorporated in standard fashion 
45 as part of RAM 606 as virtual memory. A specific pri- 
mary storage device 604 such as a CD-ROM may also 
pass data uni-directionally to the CPUs 602. 
[0031] CPUs 602 are also coupled to one or more 
input/output devices 610 that may include, but are not 
so limited to, devices such as video monitors, track balls, 
mice, keyboards, microphones, touch-sensitive dis- 
plays, transducer card readers, magnetic or paper tape 
readers, tablets, styluses, voice or handwriting recog- 
nizers, or other well-known input devices such as, of 
55 course, other computers. Finally, CPUs 602 optionally 
may be coupled to a computer or telecommunications 
network, e.g., an internet network or an intranet net- 
work, using a network connection as shown generally at 
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612. With such a network connection, it is contemplated 
that the CPUs 602 might receive information from the 
network, or might output information to the network in 
the course of performing the above-described method 
steps. Such information, which is often represented as a 5 
sequence of instructions to be executed using CPUs 
602, may be received from and outputted to the net- 
work, for example, in the form of a computer data signal 
embodied in a carrier wave. The above-described 
devices and materials will be familiar to those o1 skill in 70 
the computer hardware and software arts. 
[0032] While the present invention has been 
described as being used with a computer system, it 
should be appreciated that the present invention may 
generally be implemented on any suitable device capa- 15 
ble of digitizing audio and/or video signals. Specifically, 
the methods of utilizing audio signals to determine a 
dominant user can be applied to wireless conference 
systems where reducing the bandwidth is important 
without departing from the spirit or the scope of the 20 
present invention. Therefore, the present examples are 
to be considered as illustrative and not restrictive, and 
the invention is not to be limited to the details given 
herein, but may be modified within the scope of the 
appended claims. 25 

Claims 

1. A method of establishing a multipoint conference 
among a plurality of communication units (108, 1 10, 30 
112,114), wherein each of the communication units 

is coupled to a multimedia conference unit (102) via 
a network, wherein the communication units (108, 
110, 112, 114) communicate with each other by 
passing signals over the network, said method 35 
comprising: 

determining which of the plurality of communi- 
cation units (108, 1 10, 112, 1 14) is a dominant 
communication unit, the others bang desig- 40 
nated as suborcOnate communication units; 
and 

suppressing a portion of the signals passed 
through said network by the subordinate com- 
munication units. <5 

2. A method according to claim 1 , wherein the deter- 
mining is performed by the multimedia conference 
unit (102). 

so 

3. A method according to claim 1 or 2, wherein the sig- 
nals include audio signals and video signals and 
the audio signals take the form of audio packets 
and the video signals preferably take the form of 
video packets. 55 

4. A method according to any of the preceding claims 
wherein suppressing a portion of the signals com- 



prises: 

issuing a command by the multimedia confer- 
ence unit (102) to the subordinate communica- 
tion units to stop sending video packets over 
said network, such that each of the subordinate 
communication units is only receiving the dom- 
inant communication unit video packets, and 
preferably such that only the dominant commu- 
nication unit is sending video packets. 

5. A method according to any of the preceding claims, 
further comprising: 

sending video packets directly from the domi- 
nant communication unit to selected ones of 
the subordinate communication units in a multi- 
cast mode. 

6. A method according to any of the preceding claims, 
further comprising sending video packets from a 
selected subordinate communication unit directly to 
the dominant communication unit. 

7. A method according to any of the preceding claims 
wherein the initial dominant communication unit is a 
first communication unit to contact the multimedia 
conference unit (1 02). 

8. A method according to any of the preceding claims 
further comprising: 

determining if the dominant communication 
unit has changed from the first one of the com- 
munication units to a second one of the com- 
munication units; 

and if so, commanding the first communication 
unit to stop sending video signals in the form of 
video packets; and 

commanding the second communication unit to 
start sending video signals in the form of video 
packets. 

9. A method according to any of the preceding claims, 
wherein said network comprises an H.323 network 
or a wireless telephony system. 

10. A system for conducting a multimedia conference, 
comprising: 

a plurality of communication units (108, 110, 
1 12, 114). wherein each of the communication 
units provides a communication signal; 
a multimedia conference unit (102) intercon- 
necting each of the plurality of communication 
units (108,110,112,114) by way of a network, 
wherein the network carries communication 
signals provided by at least any one of the plu- 
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rality of communication units; 
a selector unit (103) coupled to the multimedia 
conference unit (102) that determines which of 
the communication units in the network repre- 
sents a dominant communication unit such that s 
the other personal communication units repre- 
sent subordinate communication units; and 
wherein the multimedia conference unit (102) 
directs the subordinate communication units to 
suppress a portion of their respective commu- 10 
nication signals and further directs the domi- 
nant communication unit to transmit its 
corresponding communication signal to each of 
the subordinate communication units, prefera- 
bly via connections to said multimedia confer- 15 
ence unit. 

11. A computer program for multimedia conferencing 
comprising: 

20 

multimedia conference unit operating instruc- 
tions, wherein the multimedia conference unit 
operating instructions determine which one of 
a plurality of communication units is a dominant 
communication unit, such that others are sub- 25 
ordinate communication units; wherein the 
multimedia conference unit operating instruc- 
tions suppress a portion of communication sig- 
nals provided by the subordinate 
communication units, preferably the video sig- so 
nals; and wherein the operating instructions 
are preferably embedded in a computer-reada- 
ble medium. 
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